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1  Preamble  &  Introduction 

Co-PI’s  note:  The  PI,  R.  Duncan  Luce,  passed  away  in  early  August  of  this 
year  (2012).  Dr.  Luce’s  long  and  distinguished  scientific  career  remained  active 
until  close  to  his  death.  The  volumes  of  scientific  papers,  books,  and  other 
published  materials  that  he  leaves  behind  speak  of  unusually  broad,  productive, 
and  important  scientific  accomplishments.  The  many  honors  he  received  in  his 
life-time,  including  the  National  Medal  of  Science,  show  the  esteem  in  which  his 
colleagues  held  him,  and  the  many  memorial  events  being  held  to  honor  him  and 
his  work  being  held  at  the  meetings  of  several  scientific  societies  demonstrate 
the  indelible  a  mark  Dr.  Luce’s  work  has  made  on  the  scientific  areas  to  which 
he  contributed.  To  say  that  Dr.  Luce  will  be  missed  as  scientist  and  as  a  person 
is  an  understatement. 

It  has  befallen  me,  his  last  Co-PI,  to  compile  this  report.  Despite  our  15  year 
long  collaboration,  it  feels  unsettling  to  put  into  words  the  last  scientific  contri¬ 
butions  Dr.  Luce  made,  but  it  also  speaks  of  how  strong  his  intellect  remained 
to  the  end  that  there  is  more  to  report  than  can  easily  be  fit  into  this  report. 
Some  of  the  threads  of  the  research  have,  as  a  consequence  of  his  passing,  loose 
ends.  A  no-cost  extension  of  the  current  grant  was  sought  to,  in  part,  tie  some 
of  these  ends,  but  due  to  changes  in  operating  procedures  at  the  AFOSR,  such 
extension  could  not  be  obtained.  It  is  nevertheless  the  case  that  all  stated  aims 
of  the  grant  were  achieved.  Additionally,  in  an  on-going  collaboration  colleagues 
important  research  paths  on  which  Dr.  Luce  was  working  will  continued  to  be 
pursued,  which  include  several  extending  from  the  work  reported  here.. 

This  final  report  of  the  Grant  describes  what  has  been  predicted  and  what 
has  been  accomplished  empirically  during  the  two  years  of  its  duration.  Some 
of  the  articles  resulting  from  this  grant  were  begun  during  a  no-cost  extension 
of  NSF  grant  BCS-0720288.  The  relevant  articles  are  denote  with  an  *  in  the 
in  the  list  of  article  resulting  from  this  grant. 

2  Scientific  Objectives  of  Research 

An  abiding  problem  of  scientific  psychological  is  how  to  measure  the  not  directly 
observable  sensation  that  intervenes  between  physical  stimuli  and  behavioral 
responses.  The  trust  of  past  efforts  has  involved  collecting  numerical  responses 
from  respondents  and  fitting  these  data  to  certain  mathematical  functions — 
e.g.,  logarithm  or  power.  This  approach  to  measurement,  while  of  much  value, 
has  some  well-documented  limits  of  generalization  within  and  across  domains 
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and  for  being  lacking  predictive  power  beyond  specialized  situations  within  a 
domain.  The  PI,  independently  and  in  collaborations,  focussed  substantial  part 
of  his  career  on  theoretical  foundations  for  measurement.  This  foundational 
work  progressed  considerably  in  the  latter  half  of  the  last  century,  but  remained 
unmatched  by  empirical  applications.  Such  empirical  applications  became  a 
focus  of  the  Pi’s  research  in  the  last  decade,  where  he  applied  measurement 
theoretical  results  to  utility  theory  on  based  on  that  work,  to  psychophysics.  The 
work  has  progressed  to  develop  behavioral  equivalents  to  certain  mathematical 
forms  of  the  psychophysical  function  and  of  how  respondents  behave  in  tasks 
aimed  at  measuring  the  sensations  of  interest.  The  overacting  objective  of  this 
grant  was  to  continue  this  effort.  The  grant  proposal  listed  eight  main  objectives, 
all  of  which  have  been  achieved.  Conceptually,  the  plan  that  has  been  executed 
had  three  main  thrusts:  expanding  the  existing  theory,  derive  from  the  theory 
accounts  for  several  outstanding  psychophysical  problems,  and  to  carry  out 
empirical  evaluations.  More  specifically: 

Theoretical  development  and  expansion:  Extend  the  theory  from  binary 
(e.g.,  two  ears)  senses  to  all  intensive  dimensions  (e.g.,  unary  taste).  Incor¬ 
porate  two  or  more  dimensions  of  a  perceptual  domain  in  a  single  account 
(e.g.,  brightness  when  hue  or  saturation  vary).  Establish  a  cross  modal 
description  that  can  a)  establish  the  degree  to  which  all  subjective  in¬ 
tensities  are  special  cases  of  a  single  ratio  scale  of  subjective  intensity; 
b)  and,  to  the  degree  to  which  that  is  not  the  case,  to  establish  a  single 
unifying  framework  for  the  interrelation  of  all  intensive  perceptions.  Indi¬ 
vidual  perceptions  differ,  wherefore  the  theory  establishes  on  the  level  of 
modeling  precisely  how  they  differ  and  relate. 

Derived  predictions  from  theory:  Studied  are:  The  distorting  perceptual 
effects  of  temporal  and/or  spatial  order  of  stimuli  presentation;  the  so- 
called  regression  effect;  the  Torgerson’s  conjecture  that  people  do  not  dis¬ 
tinguish  subjective  ratios  and  differences;  scale  properties  obtained  by  the 
ratings  paradigm. 

Empirical  evaluations:  The  extensive  (favorable)  evaluation  of  the  theory 
for  loudness  and  pitch  has  been  extended  to  brightness.  Further  empirical 
evaluation  will  complete  the  work  for  brightness  and  hue  as  well  as  include 
perceived  contrast  and  thus  establish  the  theory  as  a  unifying  description 
for  all  three  domains.  Empirical  predictions  associated  with  the  Torger¬ 
son’s  conjecture  and  measurement  scales  of  ratings  are  carried  out.  The 
theoretical  work  extends  over  the  duration  of  the  grant,  wherefore  not  all 
the  associated  empirical  work  could  logistically  be  carried  out.  However, 
in  all  instances,  the  empirical  program  is  developed  in  detail. 


3  Technical  Approach 

Our  approach  has  attempted  to  overcome  the  much  criticized  lack  of  testable, 
foundational  assumptions  in  psychology,  by  utilizing  the  theoretical  framework 
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of  axiomatic  algebra.  The  method  involves  the  formulation  of  necessary  and 
sufficient  mathematical  invariances  (axioms)  that  together  are  sufficient  to  con¬ 
clude  a  particular  representation  for  the  operations  described  by  the  axioms.  If 
the  axioms  are  shown  to  hold,  then  the  representations  are  concluded  on  the 
basis  of  mathematical  logic  without  any  requirement  for  direct  evaluation  of  the 
representations  themselves. 

Formulated  are  necessary  and  sufficient  behavioral  and  testable  condition 
from  which  the  detailed  representations,  measurement  scales,  are  derived.  The 
project  began  as  a  detailed  axiomatic  theory  of  psychophysics  based  on  the 
operations  of  matching  and  magnitude  production.  The  empirical  aspect  of  the 
approach  requires  the  primitives  of  the  theory  be  given  precise  interpretation 
in  the  given  psychophysical  domain  and  then  experimentally  evaluate  how  well 
the  behavioral  properties  are,  on  the  basis  of  data,  supported.  If  satisfactory 
support  is  found,  the  representations  are  concluded  as  descriptive  theory  for  the 
domain.  Among  the  strengths  of  this  approach  is  that  it  allows  novel  predictions 
to  be  deduced  from  the  representations  in  a  way  analogous  to  a  good  deal  of 
classical  static  physics.  An  element  of  the  work  is  thus  to  examine  the  theory 
to  derive  from  it  solutions  to  outstanding  questions  as  well  as  novel  predictions. 

Formally,  let  x,u  £  X ,  be  signals  from  a  set  of  physical  intensities  with  0 
being  the  threshold  level.  If  x  is  a  pure  tone  presented  to  the  left  ear,  then  let 
u  denote  another  such  tone  presented  to  the  right  ear  and  denote  their  joint 
presentation  as  ( x,u )  £  X  x  X,  where  the  subjective  intensity  ordering  £  is 
assumed  to  be  a  weak  order.  The  stimulus  could  equally  well  be  lights  presented 
to  the  two  eyes,  weight  in  the  two  arms,  or  any  other  binary  presentation. 

A  signal  z  is  a  matching  signal  if  (z,z)  is  judged  equally  intense  as  (x,u). 
Matching  has  the  operation  notation  x  ©  u  defined  as  x  ®  y  :=  z. 

In  ratio  magnitude  production  the  experimenter  presents  a  signal  x  and  a 
number  p  >  0  and  asks  the  respondent  to  report  the  signal  xp  such  that  the 
“interval  from  the  reference  signal  p  to  xp”  is  perceived  as  p  “times”  as  intense 
as  the  “interval  from  p  to  x” .  It  is  also  convenient  to  think  of  xp  as  an  operator: 
xp  =  x  op  p.  The  z  and  xp  are  bold  faced  to  emphasize  that  they  are  random 
variable.  When  p  =  0,  this  is  simply  S.  S.  Stevens’  (1975)  method  of  magnitude 
production. 

Luce  (2002,  2004,  2008)  presented  an  axiomatization  describing  the  two 
operation,  ®  and  op,  separately,  as  well  as  properties  linking  the  two  oper¬ 
ations,  which  ensured  that  the  same  psychophysical  function  was  involved  in 
both  operations,  establishing  a  mapping  between  the  structures  (X,  £,®,op) 
and  (R+,  >,  +,  x).  These  properties  imply  the  following  numerical  representa¬ 
tion:  A  p-additive  order  preserving  psychophysical  function  -ijj: 

ip(x  ®  y)  =  ip{x  ©  0)  +  ^(0  ®  y)  +  5ip(x  ©  0)^(0  ®  y),  6=  -1,0,1,  (1) 


And  a  weighting  function  W  over  positive  numbers  such  that 


W(p) 


ip{xp)  -  ^}{pi) 
^{x)  -  i’(Pi)  ’ 


if  p  >  1 
if  p  <  1 


(2) 


3 


Strong  support,  some  obtained  under  this  grant,  for  the  behavioral  axioms 
has  been  established  for  loudness,  brightness,  and  perceived  contrast,  focusing 
on  the  data  from  individual  respondents,  (Steingrimsson  &  Luce, 2005a,  2005b, 
2006,  2007;  Steingrimsson,  2009,  2011, 2012a, b,c).  These  experiments  demon¬ 
strate  that  we  can  successfully  apply  this  theoretical  approach  to  give  a  formal 
and  unified  description  of  the  three  domains  of  loudness,  (achromatic)  bright¬ 
ness,  and  perceived  contrast.  Furthermore,  this  description  is  unique  in  the 
sense  that  the  scales  involved  are  either  ratio  or  absolute  ones. 

In  the  representations  (1,  2),  the  two  functions  if)  and  W  figure  as  unknown 
functions.  In  the  axiomatic  approach,  invariance  properties  are  formulated  that 
if  hold  imply  certain  functional  forms.  On  basis  of  such  invariances,  we  have 
established  that  0  has  a  power  function  form  and  IT  is  a  Prelec  function  (Ste¬ 
ingrimsson  &  Luce,  2006,  2007;  Steingrimsson,  2012a). 


4  Progress  Made  &;  Results  Obtained 

The  main  results  are  grouped  into  seven  categories.  Because  some  of  this  re¬ 
search  is  a  continuation  of  previous  work,  most  detailed  are  those  areas  obtained 
under  this  grant.  Item  1  and  2  involve  direct  expansion  of  the  theory;  items 
2-6  results  that  are  in  various  ways  derived  from  the  theory  itself;  while  the 
last  time,  7,  is  an  alternative  axiom  underlying  the  p-additive  representation, 
(1),  that  simplifies  its  empirical  evaluation.  In  all  cases,  the  progress  on  the 
associated  empirical  program  is  reported. 

1.  Theory  of  Unary  Senses 

The  ears,  eyes,  and  arms  are  binary  receptors  that  function  as  cooperative 
pairs.  Had  there  had  been  a  grand  designer,  rather  than  evolution,  we  might 
have  considered  having  3  arms  and  hands — 2  to  hold  and  the  middle  one  to 
manipulate.  In  reality,  we  need  not  consider  senses  beyond  the  binary  ones, 
but  there  are  numerous  unary  ones.  Examples:  taste,  electric  shock,  vibration, 
force,  linear  extent,  preference  for  money,  etc.  A  crucial,  and  novel,  realization 
is  that  the  unary  senses  present  constraints  that  lead  to  modeling  that  differ 
from  the  binary  one.  To  separate  the  two,  we  refer  to  the  binary  cases  as  2-D 
and  he  unary  ones  as  1-D. 

Central  is  the  observation  that  for  the  2-D  cases,  the  concatenation  x  ®  u 
of  signals  is  over  two  sensory  organs,  in  which  case  the  important  operator 
property  of  commutativity,  x  ©  u  ~  u  ©  x,  is  over,  e.g.,  the  two  eyes  or  ears, 
and  that  has  empirically  generally  been  found  to  fail.  In  the  unary  theory, 
the  interpretation  is  of  simple  physical  concatenation  of  x  and  u ,  denoted  x  0 
u ,  which,  of  course,  means  that  its  physical  measure  is  just  the  sum  of  the 
two  intensities:  x  +  u.  Thus,  there  is  no  experimental  issue  about  finding  ©. 
This  case  was  modelled  in  physics  by  Holder’s  (1901)  axioms,  heavily  involving 
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commutativity  and  associativity  such  that  for  all  signals  x,  u ,  and  z, 


(3) 

(4) 


X  ©  U  ~  U  ©  X, 

(x  ©  u)  ©  2  ~  a;  ©  (u  ©  2). 

This  led,  via  Holder’s  theorem,  to  a  mapping  (-X-,  £,©)  into  (R+,  >, +).and 
a  simple  additive  representation.  In  contrast,  as  with  the  binary  theory,  we 
assume  magnitude  production  and  a  linking  axiom  between  the  op  production 
structure  and  the  ©  structure.  These  require  both  addition  and  multiplication, 
which  means  we  definitely  need  to  map  into  (R+,>,+,  x),  which  according  to 
Holder’s  (1901)  axioms  about  lead  to  three  possible,  very  distinct, 

representations,  namely, 

<p(x®y)  =  ip(x)  +  ip{y)  +  6ip{x)ip(y),  d  =  -1,0,1.  (5) 

Unlike  the  binary  case,  Luce  (2012a)  shows  there  are  three  form  for  the 
psychophysical  function  ip  corresponding  to  the  value  of  5:  For  x  €  R+ 

Po(x)  =  yx  (y  >0)  if  5  =  0 

ipi{x)  =  eXx  —  1  (A  >  0)  if  6  =  1 
ip-i(x)  =  1  —  e~KX  (k  >  0)  if  5  =  —1 

These  predictions  lead  a  somewhat  complicated  predictions  of  cross-modal  map¬ 
pings  between  domains,  which  we  have  worked  out  and  are  reported  in  Table  1 
of  Luce  (2012a) — discussion  of  these  results  in  relation  to  utility  theory  are  also 
briefly  addressed  by  Luce  (2011). 

The  extant  empirical  literature  (Stevens,  1975  summarizes)  for  unary  do¬ 
mains  defends  power  functions.  And  indeed,  for  S  =  0,  ip  is  a  special  case  of  a 
power  function,  but  for.  <5^0,  these  two  exponential  function  clearly  are  not 
power  functions;  what  gives?  It  may  simply  turn  out  that  the  extant  data  are 
not  as  indicative  of  power-functions  as  has  been  argued.  .  For  example,  Stevens 
(1959)  reported  averaged  cross-modal  matches  between  loudness  of  noise,  vibra¬ 
tion,  and  shock  each  linear  fit  to  these  data  measured  in  dB  appeared  decent, 
but  visual  evaluations  can  be  deceiving:  Stevens  did  indeed  note  them  to  be  far 
less  satisfactory  than  desired.  Our  post-hoc  fitting  with  predicted  functions  for 
cross-modal  matches  seem  to  improve  the  fits. 

By  mapping  (X,^,Q)  just  into  (ffi+,>,+)  rather  than  into  the  full  real 
numbers,  Holder  and  the  rest  of  the  field  overlooked  these  solutions  to  his  ax- 
iomatizations  (Luce,  2013b).  The  omission  did  not  matter  for  physics,  but  they 
certainly  appear  to  matter  greatly  for  the  behavioral  and  economic  sciences.  So 
it  was  not  until  the  work  carried  out  under  this  grant  that  we  fully  realized  the 
impact  of  the  incompleteness  of  Holder’s  theorem. 

2.  Cross-Dimensional  and  -Modal  Matching 

A.  Steingrimsson  and  Luce  (2007)  showed  that  a  certain  commutative  property 
proposed  by  Narens  (1996)  was  equivalent  to  measurement  on  a  ratio  scale, 
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i.e.  whether  xp  =  “p  times  x”  followed  by  xPi9  =  “q  times  xp”  is  the  same  as 
when  the  order  of  p  and  q  is  reversed  to  q  and  p.  That  is,  does  xPi9  =  x9iP? 
But,  what  if,  e.g.,  loudness  for  stimulus  fixed  at  one  frequency  was  a  ratio 
scale,  how  did  that  scale  compare  to  one  for  another  fixed  frequency?  Luce, 
Steingrimsson,  and  Narens  (2010)  generalized  the  commutativity  property  to 
showing  that  under  any  of  several  cross-dimensional  conditions,  the  property’s 
holding  was  equivalent  to  the  same  ratio  scale  obtaining  across  those  dimensions. 
They  showed  this  to  hold  empirically  for  loudness  and  pitch.  During  this  grant 
Steingrimsson,  Luce,  and  Narens  (2012)  extended  the  empirical  work  to  conclude 
that  brightness  for  different  hues  also  forms  a  single  ratio  scale. 

B.  We  realized  that  axiomatic  properties  that  could  relate  scales  of  loudness 
for  different  pitch,  could  equally  be  applied  to  cross  modal  questions  such  as 
how  scales  of  loudness  relate  to  scales  of  brightness  and  therefore  to  evaluating 
the  intriguing  prospect  of  establishing  a  single  scale  for  multiple  domains. 

C.  Luce  (2012a)  used  the  properties  of  the  additive  binary  representation 
(1)  and  p-additive  unary  (5)  ones  to  evolve  theoretical  predictions  for  cross 
modal  matches.  They  are  appreciably  more  complex  than  Stevens  (1975,  Ch.  4 
for  summary)  recognized,  but  appear  to  have  some  properties  exhibited  in  his 
data.  The  program  for  the  unary  theory  (Luce,  2012a)  as  well  as  the  increased 
complexity  of  cross-modal  matching  predictions  presents  a  considerably  enlarged 
empirical  program  than  previously  envisions  that  must  be  executed  to  answer 
the  proposition  in  point  B. 

3.  Torgerson’s  (1961)  conjecture 

Torgerson’s  (1961)  conjectured  that  respondents  fail  to  distinguish  subjective 
differences  from  subjective  ratios.  Luce  (2012b)  showed  that  when  applied  to 
eqisections  and  fractionation  that  the  conjecture  implies  that  the  function  W 
in  the  production  representation  (2)  is  the  identity  function.  That  W  is  the 
identity  function  is  firmly  rejected  by  existing  data  (Ellcrmeier  &  Faulhammer, 
2000;  Steingrimsson  &  Luce,  2005b;  Steingrimsson,  2011,  2012c;  Zimmer,  2005). 
Yet,  direct  evaluations  of  the  conjecture  have  produced  mixed  results,  some 
favoring  it  and  others  rejecting  it.  Why  might  this  be? 

Most  empirical  efforts  have  relied  on  stimuli  which  when  measured  in  phys¬ 
ically,  “ought  to”  allow  respondents  to  make  distinct  evaluations  of  differences 
and  ratios.  However,  when  these  stimuli  are  evaluated  in  psychological  space 
using  plausible  instantiations  of  the  summation  representation  (1),  these  stimuli 
appear  in  many  cases  to  be  ill  chosen.  On  the  basis  of  the  psychophysical  model, 
we  evaluate  the  conjecture  in  several  different  ways.  Using  auditory  stimuli,  we 
decided  to  try  having  respondents  tell  us  qualitatively  their  perceptual  order 
between  pairs  of  signal  according  to  differences  and  ratios,  which  according  to 
Torgerson  they  are  unable  to  do;  in  brightness,  we  asked  respondents  to  adjust 
intensity  of  illuminated  squares  to  agree  either  in  equality  of  differences  or  in  ra¬ 
tios.  In  all  cases,  the  choices  of  stimuli  and  prediction  of  responses  are  informed 
by  our  model.  These  predictions  were  formed  on  the  basis  of  the  representations 
(6,  7),  detailed  under  the  next  item. 
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The  evidence  shows  respondents  give  differential  responses  consistent  with 
both  ratio  and  differences  judgments  but  they  do  so  based  on  individual  and 
task  dependent  strategies  that  are  seemingly  unaffected  by  ratio/difference  in¬ 
structions.  These  data  thus  suggest  a  refinement  of  the  conjecture  to  include 
the  ability  to  perform  both  ratio  and  difference  judgments  but  that  respondents 
cannot  surface  these  judgments  simultaneously.  Rather,  they  shift  between  the 
two  scenarios  in  a  yet  unspecified  manner  (Steingrimsson  &  Luce,  2012b).  A 
happy  consequence  of  this  seemingly  novel  characterization  of  the  conjecture  is 
that  the  data  appear  to  simultaneously  give  a  plausible  account  for  the  theoret¬ 
ically  derived  evidence  against  the  Torgerson’s  (1961)  conjecture  based  on  the 
W  function  (Luce,  2012b)  as  well  as  of  the  inconsistency  in  the  empirical  record. 

4.  The  scales  of  ratings  scales 

The  use  of  ratings  on  a  scale  of  {a,  a  + 1,  a  +  2, ...,  n},  a  €  Z  to  evaluate  “magni¬ 
tude”  are  ubiquitous  in  psychology.  Examples  include  pain  in  medical  settings, 
service  satisfaction  in  a  restaurant,  effect  of  medication  on  depression,  and  so 
on.  Yet,  despite  wide-spread  use,  no  principled  analysis  of  the  underlying  scale 
of  these  ratings  has,  to  our  knowledge,  appeared  in  the  literature. 

Rating  scales,  such  as  {1,  2,  3,4,  5}  or  {0, 1, 2, ....,  10},  are  dimensionless,  and 
so  they  are  really  not  compatible  with  ratio  or  ordinal  scales.  Were  they  ratio 
scales  it  should  be  equally  satisfactory  to  use  {2.8,5.6,8.4,11.2,14}  instead  of 
{1,  2, 3, 4,  5}  and  were  they  ordinal,  the  ratings  of  {1, 2, 3, 4,  5}  should  yield  the 
same  conclusions  as  their  cubes  {1,8,27,64,125},  neither  of  which  we  expect. 
Whether  ratio  or  ordinal  is  assumed  (often  simply  implicitly  by  way  of  a  choice 
of  statistical  tool),  then  the  researcher  feels  it  appropriate  to  use  statistical 
inferences  appropriate  to  that  level  of  scale  type,  which  simply  is  not  really 
justified  just  by  declaration. 

We  suggest  that  when  the  respondent  is  asked  to  scale  something  using  the 
ratings  {0, 1,  2, ....,  10},  they  are  using  these  integers  simply  to  count.  That  is 
not  a  case  of  scaling  in  the  usual  measurement  sense.  Of  course,  that  leaves  it 
entirely  up  to  the  respondent  “to  decide”  what  to  count.  If  this  view  is  correct, 
it  means  ratings  are  on  absolute  scales  giving  no  more  trouble  using  standard 
statistical  measures  such  as  the  mean,  standard  deviation,  median  than  there  is 
with  any  other  counts.  For  example,  a  farmer  may  want  to  estimate  how  many 
oranges  will  end  up  in  a  box.  So  the  farmer  counts  of  the  numbers  of  oranges 
in  several  identical  boxes  and  computes  the  mean  and  standard  deviation,  but 
no  one  would  claim  that  the  mean  count  should  be  viewed  as  a  ratio  scale  of 
anything.  It  is  a  simple  fact  of  counting. 

The  next  question  is,  in  a  rating  scale  procedure,  what  is  it  that  the  re¬ 
spondent  is  counting?  The  answer  is  not  obvious.  We  propose  two  hypotheses 
each  of  which  needs  to  be  evaluated  experimentally.  We  wish  to  explore  the 
very  general  conjecture  that  when  asked  to  evaluate  intensity  on  a  rating  scale, 
the  respondent  establishes  in  his  or  her  mind  an  ordered  partition  of  subjective 
intensity,  and  the  count  is  to  report  which  partition  includes  the  current  signal. 
Suppose  a  respondent  is  asked  to  rate  on  a  scale  of  {1,  2, ...,  n}  signals  from  the 
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reference  “interval”  from  x  to  y ,  where  x  <  y.  Then,  the  ratings  will  form  a 
distribution  from  which  the  subjective  category  boundaries  can  be  estimated. 
Based  on  the  psychophysical  model  as  well  as  the  work  on  the  Torgerson’s 
conjecture,  two  different  possible  representations  present  themselves.  The  first 
predicts  equally  spaced  intervals,  i.e. ,  when  the  respondent  is  asked  to  partition 
the  interval  (x,  y)  into  n  subjectively  “equal”  intervals,  then  the  interval  of  the 
first  m  <  n  of  these  equal  ones,  called  emjH  must  satisfy 


(n  -  m)  (t/)(em,„)  -  i>{x))  =  m  {ijj(y)  -  ^(era,„))  (6) 

TYl 

o  ip(em,n)  =  —  [4>(y)  -  V’fr)]  + 

n 

Replacing  ip{x)  =  ax13 ,  we  have 


The  other  is  of  equal  ratios,  i.e.,  when  the  respondent  is  asked  to  report  the 
signal  rmtn  that  divides  the  interval  (x,  y)  so  that  (x,  rm>n)  is  in  the  “ratio”  — 
to  (x,  j/),  then  we  assume  that  this  means 


w  =  V>(rm,n)  -  ^(x) 

V  n  )  ip(y)  —  il>( x ) 

Replacing  ip( x)  =  ax13  and  solving  for  rm^n 


(7) 


V(^y  +  (i-w(^))**' 


1//3 


Data  need  to  be  collected  to  evaluation  which  of  these,  if  either,  best  describe 
the  data.  Pilot  data  have  been  collected. 


5.  Non-Equal  Matches 

When  respondents  match  one  signal  to  another  signal  they  do,  in  fact,  not  usu¬ 
ally  produce  same  physical  signal.  We  call  these  non-equal  matches  (NEM). 
Special  cases  include  the  time-order  error  (TOE)  in  audition  and  the  space- 
order  error  in  vision.  Steingrimsson  and  Luce  (2012a)  collected  extensive  data 
on  NEM  using  a  certain  version  of  the  method  of  adjustment  (MA)  and  these 
data  accorded  well  with  (2).  Psychologists’  often  used  2IFC  method  yielded 
data  that  are  far  less  coherent  than  those  from  MA  run  in  blocks.  For  example, 
the  MA  theory  predicts  either  of  two  very  different  patterns  of  behavior,  and 
our  respondents  divided  equally  between  the  two  patterns.  That  seems  to  have 
important  implications  about  the  methodology  of  psychological  experiments  in 
general.  In  particular,  one  should  not  average  over  respondents — there  are  im¬ 
portant  individual  differences. 


6.  Regression  Effect 


The  regression  effect  is  the  name  Stevens  gave  to  the  fact  that  linear  equations 
relating  signal  intensity  to  magnitude  estimation  and  to  magnitude  production 
do  not  quite  agree.  On  the  assumption  that  magnitude  estimation  satisfies  the 
same  representation  (2)  but  with  x ,  u  the  two  given  signals  and  p  the  respondent 
determined  measure,  we  derived  a  simple  theoretical  account  of  the  regression 
effect,  which  in  the  context  of  (2)  asks  whether  it  does  hold  for  magnitude 
estimation.  This  again  is  both  a  theoretical  question  (Luce,  2013a)  and  an 
empirical  question,  which  has  not  yet  been  investigated. 

7.  Conjoint  Commutativity 

The  empirical  study  of  the  axioms  underlying  the  conjoint  additive  representa¬ 
tion  (1)  initially  focused  mostly  on  the  double  cancellation  axiom.  That  axiom 
contains  redundant  features  that  make  its  evaluation  a  major  challenge.  The 
special  case  of  double  cancellation,  the  Thomsen  condition,  was  shown,  in  the 
full  axiomatic  context  to  be  equivalent  to  the  double  cancellation  property  ab¬ 
sent  the  undesirable  redundancies.  Therefore,  in  our  empirical  evaluation  of  the 
theory  in  loudness  and  brightness,  we  studied  that  property.  However,  although 
we  found  substantial  support  for  the  it,  we  observed  certain  problems  in  its 
empirical  realization,  as  had  some  previous  researchers  before  us  (Gigerenzer 
&  Strube,  1983).  We  concluded  the  trouble  stemmed  from  unequal  number  of 
compound  estimate  involved  in  the  estimated  signals  subjected  to  statistical 
evaluation  (a  major  culprit  is  the  phenomenon  of  NEM  reported  in  item  5).  We 
showed  that  the  property  of  conjoint  commutativity,  first  proposed  by  Falmagne 
(1976),  who  called  it  the  commutativity  rule,  is  equivalent  to  the  Thomsen  con¬ 
dition,  a  result  that  seems  to  have  been  overlooked  in  the  literature.  We  detailed 
the  issue  and  subjected  this  property  to  empirical  evaluation  for  both  loudness 
and  brightness  (Luce  &  Steingrimsson,  2011).  Subsequently  we  evaluated  only 
conjoint  commutativity  for  perceived  contrast  (Steingrimsson,  2012b).  In  all 
cases,  the  data  strongly  supported  the  conjoint  commutativity  and  thereby  con¬ 
joint  additivity  in  these  domains. 


5  Significance  of  Results  &  Impact  on  Science 

This  Grant  has  enabled  the  extension  of  the  theory  described  in  Section  3,  ad¬ 
dition  to  the  already  strong  evidence  for  the  behavioral  invariances  underlying 
the  theory  described  in  Section  3,  and  extension  of  this  evaluation  to  an  ad¬ 
ditional  domain.  A  number  of  predictions  from  the  theory  have  been  derived. 
All  predictions  that  have  so  far  been  tested  have  been  sustained.  Continuing 
projects  have  been  outlined. 

The  extension  of  the  2-D  theory  to  include  the  1-D  modalities  provides  a 
complete  psychophysical  theory  of  global  percepts  of  intensive  dimensions.  Prior 
and  novel  empirical  work  provides  strong  support  for  the  2-D  theory  across  three 
separate  domains  and  extant  data  lends  plausibility  to  the  1-D  theory,  which  in 
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turn  is  associated  with  a  clear  empirical  program.  This  results  is  important  in 
three  major  ways.  First,  it  is  a  complete  theory  of  intensive  dimensions,  which 
by  virtue  of  being  domain  independent,  and  verified  or  in  principle  verifiable 
in  all  intensive  domains,  provides  a  unified  formal  description  for  this  area  of 
human  perception.  Second,  the  theory  captures  both  the  general  behavior  of 
(normal  and  healthy)  people  while  explicitly  specifying,  on  the  level  of  param¬ 
eters,  how  individual  behavior  may  vary.  Consequently,  the  data  are  analyzed 
and  evaluated  on  individual  bases  rather  than,  as  is  a  frequent  practice  in  psy¬ 
chology,  averaging  over  individual  respondents.  The  latter  is  important  because 
unless  individual  scale  properties  are  identical,  which  has  been  soundly  rejected 
empirically,  the  practice  of  averaging  over  respondents  limits  generalizability  of 
results  to  a  population,  and  in  many  cases  is  not  mathematically  justified  treat¬ 
ment  of  the  data.  Third,  the  theory,  by  way  of  its  axiomatic  approach,  has  the 
feature  that  the  behavioral  properties  (axioms)  themselves  can  be  separately 
evaluated  empirically  and  if  found  to  hold  lead  to  the  representations  by  way 
of  mathematical  logic.  There  is  thus  no  need  for  fitting  data  to  the  model  or 
empirically  evaluating  the  representations  themselves.  This  approach,  typical 
for  classical  statistic  physics,  to  psychophysics  is  novel  in  its  scope  and,  as  is 
the  case  for  physics,  possesses  considerable  power  to  arrive  at  derived  account 
for  numerous  outstanding  problems  as  well  as  novel  predictions.  It  is  possible 
that  we  have  but  scratched  the  surface  of  what  the  results  are  telling  us. 

Stimuli  can  often  vary  on  more  than  one  dimension.  We  have  extended  the 
theory  to  evaluate  scale  properties  of  one  while  another  is  varied.  This  has 
allowed  us  to  demonstrate  that  a  single  ratio  scale  obtained  for,  e.g.,  loudness 
varying  in  pitch  and  brightness  varying  in  hue.  More  of  these  evaluations  remain 
to  be  carried  out.  A  particularly  intriguing  possibility  is  that  a  single  ratio  scale 
obtains  for  two  or  more  domains  and  we  have  laid  out  the  empirical  program 
needed  to  answer  this  question.  This  program  is  simple  in  the  case  of  2-D 
domains,  but  due  to  novel  predications  for  the  psychophysical  functions  for  the 
1-D  domains,  cross-modal  predictions  between  the  2-D  and  1-D  domains  are 
unexpectedly  complex  requiring  substantial  new  data.  The  empirical  program 
needed  for  this  evaluation  outlined  in  detail.  At  the  completion  of  this  program, 
we  will  be  able  give  a  complete  answer  to  questions  about  the  scale  properties 
of  (in  principle)  any  intensive  dimension,  both  as  it  varies  on  other  dimensions 
within  a  modality  as  well  as  how  those  scales  relate  across  modalities. 

The  theory  established  the  necessary  and  sufficient  axioms  for  concluding 
ratio  scale  measures  using  magnitude  estimation/production  (ME/P).  One  aim 
of  the  grant  was  to  extend  methodological  axiomatization  beyond  ME/P,  and 
in  particular  to  ratings.  The  paradigm  of  ratings  is  particularly  ubiquitous  in 
psychology,  even  as  the  scale  properties  of  the  obtained  data  are  poorly  es¬ 
tablished  and  the  results  are  treated  in  ways  implying  unformulated  and  poorly 
founded  assumptions.  The  primary  goal  is  to  establish  the  scale  type  of  the  data 
collected  by  these  methods  and  thereby  explicitly  establish  which  statistics  are 
justified.  This  work  is  well  advanced  and  when  completed  promises  to  vastly 
improve  the  tools  psychologists  use  for  collecting  and  analyzing  data  obtained 
using  ratings. 
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The  issues  related  to  scale  properties  of  data  collected  by  various  methods 
are  linked  to  some  long-standing  question.  For  instance,  the  Torgerson’s  con¬ 
jecture  (that  respondents  do  not  distinguish  between  judgments  of  the  ratios 
of  sensations  and  their  differences),  is  a  question  that  turns  out  to  matter  sub¬ 
stantially  in  the  establishment  of  measurement  scales,  yet  has  so  far  has  resisted 
theoretical  and  empirical  conclusive  resolution.  Our  derived  prediction  from  the 
core  psychophysical  theory  turned  out  to  address  this  question  in  an  unexpected, 
but  successful,  way. 

Another  matter  of  importance  is  the  well-known  effect  of  temporal  and/or 
spacial  stimulus  presentation,  which  we  call  non-equal  matches  (NEM).  This 
effect  is  often  thought  of  as  context  effects  that  can  be  “averaged  away”.  We 
derived  an  account  of  this  effect  from  our  psychophysical  theory  and  showed 
that  an,  a  priori,  assumption  about  how  to  control  for  it  by  averaging  is  not 
a  sound  approach  and  that  the  effect  can  vary  by  sufficiently  from  one  data 
collection  methods  to  affect  the  conclusions  drawn  from  the  resulting  data. 

Evaluating  the  impact  of,  for  instance,  the  NEM  alerted  us  to  the  need  to 
examine  both  our  existing  data  as  well  as  the  underlying  behavioral  properties 
of  the  theory.  This  analysis  led  us  to  seek  an  alternative  for  the  Thomsen 
condition,  a  property  central  to  conjoint  additivity,  namely  the  property  of 
conjoint  commutativity  that  from  an  empirical  stand  point  is  superior  to  the 
Thomsen  condition. 

The  theory  itself  is  mathematically  explicit,  formal  description  of  individ¬ 
ual  behavior.  This  provides  possible  applications  which  are  computational  in 
nature.  In  our  work  on  the  Torgerson’s  conjecture,  we  used  plausible  instantia¬ 
tions  of  the  parameters  of  the  theory  to  choose  stimuli  that  were  appropriate  in 
the  psychological  context  and  to  show  that  stimuli  used  in  prior  research  that 
seemed  to  be  appropriate  based  on  their  physical  context,  were  decidedly  not  so. 
This  is  one  instance  of  possible  application  of  the  computational  aspect  of  the 
theory.  Any  number  of  other  applications  involving,  e.g.,  computer  simulation 
of  psychophysical  properties  seem  plausible. 

If  it  turns  out  the  idea  of  a  single  subjective  intensity  scale  over  some  inten¬ 
sive  continua  appears  to  hold,  it  suggests  the  practical  possibility  of  using  one 
easily  manipulated  measure  as  a  proxy  for  another  much  more  difficult  measure 
or  to  manipulate.  Can  we  use  that  to  improve,  in  practice,  the  measurement  of 
pain  in  terms  of  something  easily  controlled  such  as  loudness,  as  has  recently 
been  proposed  by  Bartoshuk  (2010)?  The  major  problem  here  is  that  for  in¬ 
herently  1-D  attributes,  the  p-additive  patterns  of  item  1,  Section  4,  need  to 
be  thoroughly  studied — a  very  long  term  project.  We  do  not  know  for  sure 
which  1-D  attributes  correspond  to  S  =  1,0, —1  and  which  can  be  successfully 
matched. 
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